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1.  Introduction 


Traditional  tests  of  hypotheses  are  special  cases  of  multiple  decision  procedures.  Usu¬ 
ally,  they  axe  restricted  to  two  decisions.  These  testing  methods  emphasize  on  a  confir¬ 
matory  approach.  The  hypotheses  are  specified  before  the  data  are  collected.  However,  it 
is  difficult  sometimes  to  specify  appropriate  hypotheses  of  interest  based  on  past  experi¬ 
ence.  We  take  a  data  analysis  approach  to  the  problem  of  decision  making  (see  Gupta  and 
Huang,  1981). 

Usually,  an  experimenter  faces  the  problem  of  comparing  several  categories  or  popu¬ 
lations.  The  classical  approach  to  this  problem  is  to  test  homogeneity  (null)  hypothesis 
Ho  :  9i  =  9k,  where  9i,. ..  ,9k  are  the  unknown  values  of  the  parameter  9  for  these 

populations.  In  the  case  of  normal  populations  with  unknown  means  9i,. ..  ,9k  and  a 
common  unknown  variance  cr^ ,  the  test  can  be  carried  out  by  means  of  the  F-ratio  of  the 
analysis  of  variance. 

In  this  paper,  we  formulate  above  ^-sample  problem  as  a  multiple  decision  problem  in 
analysis  of  variance  and  regression  analysis.  Among  the  early  investigators  of  procedures 
for  such  problems  are  Paulson  (1949),  Bahadur  (1950),  Bahadur  and  Robbins  (1950).  The 
formulation  of  multiple  decision  procedures  in  the  framework  of  selection  and  ranking 
procedures  has  been  generally  accomplished  by  using  either  the  indifference  zone  approach 
or  the  (random-sized)  subset  selection  approach.  The  former  approach  was  introduced 
by  Bechhofer  (1954).  Substantial  contribution  to  the  early  and  subsequent  developments 
in  the  subset  selection  theory  have  been  made  by  Gupta  starting  from  his  work  in  1956. 
We  will  be  mainly  concerned  with  multiple  decision  problems  formulated  as  selection  and 
ranking  problems  (see  Gupta  and  Panchapakesan  (1979)). 

In  this  paper,  we  discuss  inference  about  the  parameters  in  multiple  linear  regression 
model.  By  a  proper  reparametrization,  AN OVA  models  can  be  handled  by  regression  tech¬ 
niques.  As  Draper  and  Smith  (1981)  pointed  out,  it  is  useful  to  appreciate  the  connection 
between  the  two  methods  of  analysis.  We  consider  here  only  the  one-way  ANOVA  model. 

Om  interest  is  not  just  to  test  the  null  hypothesis  Hq  against  the  global  alternative. 
When  Ho  is  rejected,  we  want  to  identify  significantly  important  independent  variables 
and  also  check  the  appropriateness  (to  be  explained)  of  the  choice  of  the  variables  or  the 
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factor  levels. 


2.  Inferences  About  Regression  Parameters 
Consider  the  linear  model 


(2.1) 


where  F'  =  [Fi,  F2, . . . ,  Yn]  is  an  n  x  1  vector  of  responses,  X  =  [1,  . . . ,  is  an 

n  X  p{n  >  p)  matrix  of  known  constants  of  rank  p,  =  [/3o ,  /di , . . . ,  /3p-i]  is  a  1  x  p  vector  of 
unknown  parameters,  s  ~  N(0,  cr^In),  and  In  denotes  the  nxn  identity  matrix.  We  refer  to 
model  (2.1)  as  the  true  model  of  size  p.  From  this  model,  we  obtain  p—  1  so-called  reduced 
models,  each  of  size  p—  1,  by  dropping  one  independent  variable  at  a  time.  Let  A’(^)  denote 
the  “A  matrix”  of  the  reduced  model  obtained  by  dropping  the  independent  variable  Xm, 
m  =  1,2. ..p— 1.  Correspondingly,  we  have  the  residual  sums  of  squares  for  these  reduced 
models  denoted  by  SSp-\^rni  =  1 ...  ,p  —  1.  Accordingly,  the  residual  sum  of  squares 
for  the  true  model  is  denoted  by  55p,i.  It  is  known  that  SSp^i  =  Y^QY_  and  SSp-i^m  = 
F'Q(m)F  where  Q  =  In  -  X{X'X)X'  and  -  X(^rn)iXl^)X(^rn))~^X[^y  Under 

the  true  model  assumption,  it  is  known  that 


SSp^i 


^PA  2  j  l,m  2  11 

^2  ~  Xn-p  ^2  Xn-p,Ap_i,„, ) —  I5  •  •  •  jP  ~  Ij 


where  Ap_i^,„  =  (A’^)'^(^)(A’^)/2(7^,m  =  l,2,...,p—  l,xi  denotes  the  (central)  chi- 
square  distribution  on  u  degrees  of  freedom  with  noncentrality  parameter  A.  Thus 
E[SSpy]  =  (n  -p)cr2  and  F[55p_i,„j]  =  {n-p+l)a‘^  +2(T^Xp-i^rn,m  =  l,...,p-  1. 


We  note  that  Q{m)i''^  =  1;2, ...,p  —  1,  are  idempotent  and  symmetric;  thus  they 
are  positive  semidefinite.  Hence  \p-\^nf,'ni  —  1,2, . . .  ,p  —  1  are  nonnegative.  Obviously, 
A  =  •  •  •  =  ^p-i  =  0  implies  that  \p-i,m  =  0  for  m  =  1, . . .  ,p  —  1.  However,  the  converse 
is  not  necessarily  true.  When  ^  0,  Ap_i,;n  can  be  interpreted  as  the  contribution  of  Xm 

in  making  the  regression  significant  given  that  the  other  variables  are  already  in  the  model. 
Let 

‘S'5'p_i,m]  n-p-M 

55p,i  J  2  ’ 


Ap— l,jn  — 


n 


P 


where 


— 


SSp—i^m  ^  SSp^i 

SSp,i/in-p) 
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has  the  noncentral  i^-distribution  on  1  and  (n  —  p)  degrees  of  freedom  with  noncentrality 
parameter  denoted  by  We  note  that  the  f/m  is  the  statistic  used  in 

the  so-called  partial  F-test  for  the  significance  of  (3m- 

Now,  we  consider  our  problem  of  testing  Hq  :  ^  =  0.  The  classical  F-test  for  this 
against  the  global  alternative  Ha  :  (3  0  is  designed  only  to  control  the  probability  of 

type  I  error.  In  our  formulation,  when  Ho  is  rejected,  the  decision  also  includes  selecting 
a  subset  of  the  p  —  1  independent  variables  as  significant.  When  Ho  is  false,  a  correct 
decision  (CD)  occurs  if  Hq  is  rejected  and  the  selected  subset  of  the  independent  variables 
includes  the  variable  associated  with  the  largest  Let  A  =  [Ap_i,i, . . . ,  Ap_i,p_i]  and 

^  <  Ap_x,[p_i]  denote  the  ordered  Ap_i^„i.  We  require  that,  for  given  0  <  o:  <  1 

and  ^  <  P*  <  1, 


Pr[RejectP’o  =  Q]  <  a 

(2.2) 

and 

>  A}  >  P- 

(2.3) 

where  A  >  0  is  specified  in 

advance. 

Since  ^  =  0  implies  that  A  =  0,  (2.2)  is  satisfied  if 

Pr[IlejectPolA  =  Q]  <  o 

(2.4) 

and  (2.3)  is  equivalent  to 

Inf  PrlCD|A,_,,(,.„  >  A]  =  P*. 

(2.5) 

We  propose  a  test  of  Ho  based  on  the  statistics  SNm  =  101og77m,»n  =  1, . . .  ,p  —  1,  where 
the  log  is  to  base  10.  We  could  just  use  the  rjm,  but  the  transform  denoted  by  SNm  is 
based  on  Taguchi’s  idea  of  signal-to-noise  ratio.  Our  test  procedure  is  as  follows: 

Reject  Ho  if  SNi>  C  for  some  i.  Include  in  the  selected  subset  of  significant  variables 
all  the  variables  X^s  for  which  SNi  >  C. 

The  constant  C  should  satisfy 
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Pr{SNi  >  C  for  some  z|A  =  0}  <  a 


(2.6) 


and 


Inf  Pr{5W,p.„  >  C|Ap-,,|,.,|  >  A}  =  />•  (2.7) 

A 


where  SNp-i  denotes  the  SNi  associated  with  When  A  =  0,  SNi,  i  =  I, . . .  ,p  — 

1,  are  correlated  each  having  a  central  Fi^n-p  distributions.  By  using  the  inequality: 

P[UAj]  <  '^PiAj),  (2.6)  is  satisfied  if 
j 


p-i 

Pr{SNm  >  C\\  =  0}  =  a 

m=l 


which  gives 


Pr{Fi,n-p  <  }  =  1 - ^  =  jisay).  (2.8) 

p  —  1 

When  Ap_i  ,[p_i]  >  0, 5iV(p_i)  has  the  stochastically  increasing  property  in  terms  of  the 
noncentrality  parameter.  Thus,  (2.7)  is  satisfied  if 


Pr{Fi,n-p.,A  <  10^}  =  1  -  P*.  (2.9) 

We  obtain  an  approximate  solution  to  (2.8)  and  (2.9)  in  the  form  of  C  as  a  function  of  n 
by  using  the  following  lemma  of  Huang  (1996). 

Lemma:  Let  Y  have  the  noncentral  F  distribution  with  u,v  degrees  of  freedom,  and 
noncentrality  A,  denoted  by  Fu,v,a-  Then  A 


where 


Pr{Y  <  j/}  =  [1  +  e-l-7941482:pl 


X  — 


2(tt+2A)  ,  2  (  uy 

9(u+A)^  9v  V  u+A) 


2(«+2A) 

9(uH-A)2 


1  ^ 


(2.10) 
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The  maximum  absolute  error  in  the  approximation  in  (2.10)  is  0.06  for  v  >  5.  Also  A  =  0 
gives  the  approximation  in  the  central  case. 

Now,  let  y  =  10^,  oq  =  -  l),6o  =  1.794148^^(1^  “  =  1  “ 

gCn-pj J  B  =  1  —  ^  ~  9(i+^2-  By  using  the  above-stated  lemma  in 

equations  (2.8)  and  (2.9),  we  obtain  an  approximate  solution  to  (2.8)  and  (2.9)  from  the 
following  equations: 


Ay^  -B 


[(1-S)  +  (1-A)yi]i 


ao, 


(2.11) 


AEyh  -(1-F) 


T  =  ^0- 


[F  +  il-A)E^yi]^ 

Squaring  both  sides  of  (2.11)  and  (2.12),  and  rearranging  the  terms,  we  get 

[.4^  -  ao(l  -  A)]y^  -  2ABy^  +  -  ao{l  -  5)  =  0, 

[A'‘E'^  -  60(1  -  A)E^]yi  -  2AE{1  -  F)yi  +  (1  -  -  boF  =  0. 


(2,12) 


(2.13) 

(2.14) 


2 

We  now  eliminate  ya  from  (2.13)  and  (2.14)  to  obtain 


,  _  [B^  -  aljl  -  B)][A^E^  -  blE^l  -  A)]  -  [(1  -  Ff  -  blF][A^  -  aljl  -  A)] 

^  2AB[A‘^E'^  -  blE^{l  -  A)]  -  2AE(l  -  F)[A^  -  al{\  -  A)] 

=  G,  say. 

Thus  y  =  and  y  =  10^,  yielding  C  =  30  log  G.  (2.15) 

Remark:  While  the  data  are  used  to  test  Hq  and  make  appropriate  decision,  the 
statistics  SNm  as  signal-to-noise  ratio  tell  also  something  about  the  appropriate  choice 
of  the  independent  variables.  A  negative  value  of  SNjn  (or  equivalently  rjm  <  1)  shows 
instability  of  variance  in  estimating  /5m . 

3.  Regression  Treatment  of  One-Way  ANOVA  Model 
Consider  the  ANOVA  model 


Yjj  —  fi  0j  Sij  j 


1, 2, . . . ,  J);  i  —  1,2,...,/, 


(3.1) 
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where  £i,j.,j  =  1,2, Ji;i  =  1,2,...  I,  are  independent  and  identically  distributed  normal 
iY(0,  (T^)  random  variables  and  6i+ . .  .  +  di  =  0.  By  letting  /?,■  =  jj,  +  6i,i  =  1, . . . ,  /,  model 
(3.1)  can  be  written  as  regression  model: 

Y  =  X§_  +  e 

where  r  =  [Yn,. ..  ,Y,j,},X  =  [V„...,X,I,£  =  [0. . . .  ,0; . . . ; 

...;0,....0|,i  =  l,2 . 1,  and2'  =  [ft,.--,/)/I- 

ith  block 

Let  b  =  {bi, . . .  ,bj)  where  bi  =  Y i  =  ^  is  the  least  squares  estimate  /?i, 

j=i 

I  .  I 

i  =  1, . . .  ,1.  Since  9i  =  /Si  —  j  ^  /3j,  we  unbiasedly  estimate  by  —  j  ^  bj.  It 

is  easy  to  show  that  E{9i)  =  9i,  and  Var(^i)  =  Cja^,  where  c,-  =  (1  —  7)^  'T  7^ 
i  =  l,2,...,/. 

Let  Ti  =  9^,i  =  1,. . .  ,1,  and  let  rji]  <  . . .  <  denote  the  ordered  Xj.  The  9i  are  the 
treatment  effects  and  T[/]  is  associated  with  the  treatment  whose  effect  is  farthest  from  the 
average  of  all  treatment  effects  which  is  zero.  As  a  treatment  effect  goes  farther  from  zero, 
it  is  Scdd  to  become  more  significant.  We  want  to  test  Hq  :  ri  =  . . .  =  r/  =  0  (which  is 
equivalent  to  i^o  :  /3i  =  . . .  =  ;0/)  at  level  a.  When  Hq  is  false,  a  correct  decision  occurs 
if  Hq  is  rejected  and  a  subset  including  the  treatment  associated  with  rj/j  is  selected.  Let 
r'  =  [ri, . . . ,  T/].  We  require  that 

Pr{  Reject  ifo  In  =  0}  =  a  (3.2) 

and 

Inf  Pr{CD  \t[j]  >  Acr^}  =  P*  (3.3) 

where  7  <  P*  <  1  and  A  >  0  are  specified  in  advance. 

Since  Var(^i)  =  c,cr^,  it  can  be  estimated  by  Ci  ,  where  is  the  error  mean  square 
(MSE)  in  the  one-way  ANOVA.  It  is  known  that  follows  the  noncentral  Pij-/;a,- 

distribution  where  Aj  =  =  1,...,/,  and  J  =  X/  define  our  test  statistics 

SNi  by  SNi  =  10  =  1, . . . ,  /  where  the  log  is  to  base  10.  Our  test  procedure  is 

as  follows: 


Reject  Hq  if  5iV,-  >  C  for  some  i.  If  Hq  is  rejected,  then  include  in  the  selected  subset 
of  significant  treatments  all  those  treatments  for  which  SNi  >  C. 

The  constant  C  should  satisfy 

Pr{  Max^  SNi  >C\T  =  0}=a  (3.4) 

and 

Inf  Pr{SN^)  >  C\t[i]  >  Act^}  =  P*,  (3.5) 

where  A(/)  is  the  Aj  associated  with  . 

When  r  =  0,  the  SNi  are  i.i.d.  central  So,  using  the  arguments  employed  in 

Section  2,  equation  (3.4)  is  satisfied  if 

Pr{Pi,j_7<10^}  =  l- j.  (3.6) 

Now, 

Inf  Pr{5iV(/)  >  C\t[j]  >  Acr^} 

=  Inf  >  C|t[;|  > 

where  A(/)  is  the  A/  associated  with  r(j).  Using  the  stochastically  increasing  property  of 
the  noncentral  F  in  terms  of  the  noncentrality  parameter,  we  can  see  that  equation  (3.5) 
is  satisfied  if 

Pr{Pi,j-j;A,  <10^1  =  1 -P*  (3.7) 

where  Aj  =  : . 

:<»</ 

If  we  now  let7  =  l  —  j,A  =  l—  jj,  and  define  y,  ao,bo,  B,  B,  and  F  as  in  Section 
2  with  Ai  in  the  place  of  A,  then  an  approximate  solution  to  (3.6)  and  (3.7)  is  given  by 
(2.11)  and  (2.12).  Thus  the  solution  is  (2.15),  namely,  C  =  30  log  G. 

M  (XX  0  ^ 

Remark:  One  can  use  --'7/ — ^  as  a  choice  of  A  for  a  future  study.  When  SNi  is 
negative,  as  an  estimate  of  than  1;  this  usually  means  that  the 

estimator  Oi  of  9i  is  unstable. 
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4.  An  Example 


We  illustrate  the  one-way  ANOVA  test  procedure  of  Section  3  using  the  following 
example  of  Draper  and  Smith  (1981). 

An  experiment  was  conducted  using  three  treatment  levels,  namely,  0,  100,  and  200 
mgs  of  caffeine.  Thirty  healthy  male  college  students  of  the  same  age  and  with  essentially 
the  same  physical  ability  were  selected  and  trained  in  finger  tapping.  After  the  training 
was  completed,  ten  men  were  randomly  assigned  to  each  treatment  level.  Neither  the 
men  nor  the  physiologist  knew  which  treatment  the  men  received;  only  the  statistician 
knew  this.  Two  hours  after  the  treatment  was  administered,  the  number  of  finger  taps  per 
minute  was  recorded  for  each  man. 

Let  Yij  =  number  of  finger  taps  per  minute  of  the  jth  man  on  the  ith.  treatment, 

(jL  =  true  value  for  the  average  number  of  finger  taps  in  a  population  of  males  of  which  the 
selected  thirty  from  a  random  sample,  di  =  the  ith  treatment  effect,  that  is,  the  additive 
effect  of  the  ith  treatment  over  and  above  (or  below)  fx,  where  -b  ^2  +  ^3  =  0,  and  Sij 
=  the  random  effect  which  is  a  random  deviation  from  tJ,  +  Oi  taps  per  minute  for  the  jth. 
student  who  received  the  ith  treatment. 

With  the  above  definitions,  we  have  the  ANOVA  model; 


Yij  =  fx  Oi  Sij 


and  we  assume  that  the  Sij  are  iid  N{0,cr'^). 

Now,  0'  =[o,..., 0], r  =  2^1  =  i:,o:,o:],^=[oM', O'], ^  =  [O' 

(lx  10)  (lx  10) 

and  where  ^i  =  fj,  +  0i,i  =  1,2,3.  From  the  data  we  have: 


QM1, 


X!  =  [242  245  244  248  247  248  242  244  246  242  248  246  245  247  248 

(1X30) 

250  247  246  243  244  246  248  250  252  248  250  246  248  245  250], 


a:  =  [X,  X2  ^],/  =  3,  Ji  =  J2  =  J3  =  10. 

(30X3) 


The  regression  model  is:  Y_  =  X^  -{•  s. 
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From  the  output  of  SAS  program,  we  obtain  the  following; 


h  =  244.8,  62  =  246.4,  63  =  248.3,  5^  =  4.9667. 


From  the  results  in  Section  3,  we  obtain 

1 


Cl  =  C2  =  C3  = 


15’ 


=  -1.7,02  =  -0.1,  0z  =  1.8,  Ai ,  =  8.728,  A2  =  0.03,  A3  =  9.785; 
SNi  =  9.42384,  SN2  =  -15.815,  SN3  =  9.9203. 


Suppose  we  have  chosen  A  =  0.217  and  Ai  =  15A  =  3.25  based  on  the  past  experience 

Max 

(or  a  preliminary  sample  which  yielded  -  —  jf — ^  =  3.25).  Let  a  =  0.05  and  P*  =  0.90. 
Then  C  =  3.05149.  Since  SNi  >  C  and  SN3  >  C,  we  reject  Hq  :  =  ^2  =  ^3  =  0 

and  select  treatments  1  and  3  as  significant  (i.e.  sufficiently  away  from  the  average  effect). 
On  the  other  hand,  SN2  is  not  only  far  less  than  C,  it  is  negative.  We  conclude  that  this 
treatment  level  is  not  stable. 
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